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Abstract 

We prove a result which says that when the transition matrix of a Markov 
chain has a certain structure, then the chain has no negative eigenvalues. For 
such chains, it is not necessary to make the chain lazy in order to investigate the 
mixing time. The lemma is designed to apply to heat-bath Markov chains. We 
• present two applications of this result: one to single-site heat-bath chains for spin 

systems and one to a heat-bath Markov chain for sampling contingency tables. 

O ; 1 The result and two applications 

m _ 

Suppose that fl is a finite set and let n : fl — > (0, 1] be a probability distribution on fl. 
Let £ be a nonempty finite index set and let L = \C\. Suppose that for all x G fl and 
^ | a G C we have a subset fl x>a of fl such that 

• x G fl x , a for all x G fl and a G £, and 

• for each a G C, the set {fl x , a '■ % G fl} forms a partition of fl. 
These conditions imply that for all x, y G fl and a G C, 

if y G fl x>a then fi^ = fl y>a . (1) 
Now suppose that Ai is a Markov chain on f2 with transition matrix P given by 

p =i:>> ( 2 ) 
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where, for each a G £, the \Q\ x \Q\ matrix P a (with rows and columns indexed by Q) 
is defined by 

(Here t(y G f^a) equals 1 if y G f^a, and equals otherwise.) Note that P a is well- 
defined for all a G £, since 7r is nonzero on all states and Q x>a is nonempty. 

If x G VL is the current state of the Markov chain Ai then a transition is performed 
by choosing an element a G £ uniformly at random, then sampling the next state y 
from Q xa with respect to the distribution tt restricted to £l x>a . (To make the connection 
with heat-bath chains, think of £ as the set of positions where updates may occur, and 
for a given 16O and a G £, the set Q x<a contains the possibilities for the next state.) 

When (T5]) holds it follows that Ai is aperiodic (since every state has a self-loop) and 
that Ai is reversible with respect to n. However, the chain Ai need not be irreducible. 
(See [7] for Markov chain definitions which are not given here.) 

Lemma 1. Let Ai be a Markov chain with finite state space Q and transition matrix 
P. Let 7r : Q — > (0, 1] be a probability distribution on Q. Suppose that P satisfies (EJ) for 
some index set £, sets Q XtCl and matrices P a . Then Ai has no negative eigenvalues. 

Proof. Recall that a matrix is positive semi-definite if it has no negative eigenvalues. 
By definition of P a we know that P a (x,y) — if y G" £l Xta . Furthermore ([1]) implies that 
if z G fl Xia then P a (x,y) = P a (z,y) for all y G Q. Therefore, for all x,y G Q and all 
a G £ we have 

P 2 a (x,y) = Y,Pa{x,z)P a (z,y)= P a (x , z) P a (z , y) = Pa(x,z)P a (x,y) 

= P a (x,y). 

This says that the matrix P a is a projection, so the only eigenvalues of P a are and 1. 
It follows that P a is positive semi-definite for all a G £. 

Then P is a nonnegative linear combination of positive semi-definite matrices, and it 
follows that P is positive semi-definite. (This can be proved directly from the variational 
characterisation of eigenvalues, or see for example P, Observation 7.1.3].) This completes 
the proof. □ 

Lemma [T] can be easily generalised to allow a non-uniform distribution over £, with 
the same proof. That is, we can replace (T5]) by 



P = ^p(a)P a , 



where p is a probability distribution on £. 
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1.1 Application: a single-site heat-bath chain for spin systems 



Let G = (V,E) be an arbitrary graph and let S be a finite set of spins (or colors). 
Consider the state space Q = S v and let tt be a probability distribution on Q. Given 
a G Q, for all v G V and k G S we define cr u,fc by 



(So <7 v ' fc is obtained from a by replacing the spin at v by fc.) The single-site heat-bath 
chain for Q is the Markov chain with transition matrix 



This matches the setting of Lemma [T] by choosing C = V and L = \V\, and defining 



for all a G f2 and v G V. Hence, by Lemma [Tj single-site heat-bath chains for general 



The heat-bath chain, which belongs to the family of Glauber dynamics (see for ex- 
ample [9]), has been studied by many authors including [5], [10]. In several cases, the 
continuous-time version of this Markov chain is considered. One advantage of this ap- 
proach is that mixing properties can be described solely by the second-largest eigenvalue. 
Thus, when translating these results to discrete time, it usually remains to bound the 
smallest eigenvalue of the chain. The last example shows that for the heat-bath chain, 
the established continuous-time bounds can be used without further analysis. In the 
case of the Potts model this argument was used in the proof of [12, Theorem 2.10]. 

Note that there are other Glauber dynamics, like the Metropolis chain, which are 
generally not guaranteed to have only non-negative eigenvalues. 

1.2 Application: a heat-bath chain for contingency tables 

Let r = (ri, . . . , r m ) and c = (ci, . . . , c„) be two vectors of positive integers with the 
same sum. A contingency table with row sums r and column sums c is an m x n matrix 
with nonnegative integer entries, such that the z'th row sum is and the j'th column 
sum is Cj, for i — 1, . . . , m and j = 1, . . . , n. Let Q r c denote the set of all contingency 
tables with row sums r and column sums c. 

Dyer and Greenhill [1] proposed a Markov chain for sampling contingency tables, 
which we will call the contingency chain. A transition of the chain is performed as 
follows: choose a 2 x 2 subsquare of the current table uniformly at random, then replace 
this 2x2 subsquare by a uniformly chosen 2x2 nonnegative integer matrix with the 





{a v ' k :keS} 



spin models do not have negative eigenvalues. 



o 



3 



same row and column sums. The lazy contingency chain does nothing at each step with 
probability |, and otherwise performs a transition as described above. Cryan et al. [1] 
analysed the lazy contingency chain for a constant number of rows and proved that it 
is rapidly mixing. 

To fit the contingency chain into the setting of Lemma [U let C be the set of all 
positions of 2 x 2 subsquares, and let L — \C\ — (™) uj- Let P a be the transition matrix 
of the Markov chain which acts only on the 2x2 subsquare a G C. Then P a (x, •) is 
uniform over all contingency tables y G f2 r)C which differ from x only within the 2x2 
subsquare a. Hence Lemma Q] applies and shows that the contingency chain has no 
negative eigenvalues. 

2 Implications for the mixing time 

Let M. be an ergodic, reversible Markov chain with finite state space Q, transition 
matrix P and stationary distribution tt. The eigenvalues of M. satisfy 

1 = A > Ai > A 2 > • • • > \ N -i > -1, 

where N = \ fl\. We refer to Ajv_i as the smallest eigenvalue of M.. (Note, this does not 
mean smallest in absolute value.) The connection between the mixing time of a Markov 
chain and its eigenvalues is well-known (see [HI Proposition 1]): 

r(e) <(1- K)- 1 \n^— (3) 

^ ^"min 

where r(e) denotes the mixing time of the Markov chain, 7r min = min^n 7r(x) and 

A* = max{Ai, |Ajv-i|}- 

When studying the mixing time of a Markov chain Ai, the approach which has become 
standard is to make the chain Ai lazy by replacing P by (J + P)/2, where / denotes the 
identity matrix. Then all eigenvalues of the lazy chain are nonnegative, and only the 
second-largest eigenvalue must be investigated. Clearly if P has no negative eigenvalues 
then A* = Ai and it is not necessary to make the chain lazy. 

Diaconis and Stroock [31 Proposition 2] gave a result which can be used to calculate 
an upper bound for (1 + Ajv-i) -1 for ergodic, reversible chains. A modified version 
was stated by Diaconis and Saloffe-Coste [21 p. 702]. In particular, if every state has a 
self-loop then 

(1 + Atv-i)^ 1 < \ max P(x, x)~ . 

For many combinatorial Markov chains, this approach is simple to apply and gives 
an upper bound on (1 + Aa^i)^ 1 which is several orders of magnitude smaller than 
the best-known bound on (1 — Ai) -1 . We feel that these results are under-utilised, 
and recommend them as an alternative to making Markov chains lazy (together with 
Lemma [U where it applies). 
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